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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
03/03/09 has been entered. 

Claim Rejections - 35 USC § 102 

2. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

3. Claims 47 - 52, 57 - 59 are rejected under 35 U.S.C. 1 02(b) as being anticipated 
by Palmer et al., (Chinese Word Segmentation and Information Retrieval; 1997). 

As per claims 47, and 57, Palmer et al., teach a computer-implemented method 
for evaluating a word segmentation language model, comprising: 

building the word segmentation language model ("segmentation based on 
character bigrams"; page 175, col.2, paragraph 1; page 177, col. 2, paragraph 5); 
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utilizing a computer processor that is a functional component of the computer to 
apply the language model to a test corpus of unsegmented text different from the 
annotated corpus so as to provide an output indicative of words in the test corpus and a 
word type indication for each word, the word type indication being any one of a plurality 
of word type indications ("refinement steps attempt to recognize idiomatic 
expressions, derived words, Chinese person names, and foreign proper names" 
those are considered as word type indications; pages 177, col.2, paragraph 3); 

utilizing the processor to compare the word type indication for each word in the 
output of the language model with predefined word type indications of words of the test 
corpus; and utilizing the processor to generate a quantitative value that represents a 
level of precision with which word type indications were applied in the output indicative 
of words in the test corpus, wherein generating comprises generating based on a 
comparison of the word type indication for words in the output to the predefined word 
type indications ("precision is defined as the percentage of identified words which are 
also in the same positions in the hand-segmented text... refinement steps attempt to 
recognize idiomatic expressions, derived words, Chinese person names, and 
foreign proper names, determine the contribution of each of these steps to the 
segmentation accuracy"; page 176, col.2, paragraph 5; pages 177, col.2, paragraph 3). 

As per claim 48, Palmer et al., further disclose building the word segmentation 
language model comprises building the word segmentation language model based on 
an annotated corpus ("segmentation based on character bigrams"; page 175, col.2, 
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As per claims 49, and 58, Palmer et al., further disclose generating the 
quantitative value further comprises generating a quantitative value based on a 
comparison of word type indications of words in the output that match predefined word 
type indications assigned to the same words in the test corpus indications ("precision is 
defined as the percentage of identified words which are also in the same positions in the 
hand-segmented text"; page 176, col. 2, paragraph 5). 

As per claim 50, Palmer et al., further disclose generating the quantitative value 
comprises generating a quantitative value that is indicative of how frequently a word 
type indication in the output matches a corresponding predefined word type indication in 
the test corpus ("use frequency-based phrase building, that is, segmentation based 
on character n-gram occurrences in the collection"; page 177, col .2, paragraph 5). 

As per claim 51 , Palmer et al., further disclose generating the quantitative value 
comprises generating a quantitative value that is indicative of how frequently a word 
type indication, assigned to a word in the output, matches a predefined word type 
indication assigned to a same word in the test corpus ("use frequency-based phrase 
building, that is, segmentation based on character n-gram occurrences in the 
collection"; page 177, col. 2, paragraph 5). 
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As per claims 52, and 59, Palmer et al., further disclose generating a quantitative 
value further comprises generating a quantitative value that represents a level of 
precision with which person name word type indications were applied in the output 
("Chinese person names"; page 177, col.2, paragraph 3). 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. Claims 53, 54, 60, and 61 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Palmer et al., (Chinese Word Segmentation and Information 
Retrieval; 1997) in view of Guo et al., (US PAP 2002/0052901). 

As per claims 53, 54, 60, and 61 .Palmer et al., do not specifically teach that 
location name type indications were applied in the output; organization name word type 
indications were applied in the output. 

Guo et al., teach that for Chinese language, the morphologic process 
includes the steps of: (1) segmenting sentences into words according to the 
system dictionary and the user-defined dictionaries; (2) identifying proper 
names (currently including person names, place names and person titles), domain 
terms, numbers, measure words, and date expressions (paragraph 29). 
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Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made use place names and domain terms in precision scores 
as taught by Guo et al., in Palmer et al., because that would contribute to the 
segmentation accuracy (Palmer et al., page 177, col. 2, paragraph 3). 

6. Claims 55, 56, and 62 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Palmer et al., (Chinese Word Segmentation and Information 
Retrieval; 1997) in view of Haizhou et al., (Chinese Word Segmentation, 1998). 

As per claims 55, 56, and 62, Palmer et al., do not specifically teach that 
overlapping ambiguous string word type indications were applied in the output; 
organization name word type indications were applied in the output; covering 
ambiguous string word type indications were applied in the output. 

Haizhou et al., teach As indicated by Liang[2], there are two cases of unexpected 
segmentation. One is overlapping ambiguity where a character could go either way to 
form two words, such as in example 1 . Another is composition ambiguity where the 
subsegmentation is possible:... One can find that , and all are possible word entries, 
thus both results are valid based on lexicon entries (page 215, section 3.2, paragraph 
3). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made use overlapping ambiguous string word and covering 
ambiguous string word in precision scores as taught by Haizhou et al., in Palmer et al., 
because that would contribute to the segmentation accuracy (Palmer et al., page 177, 
col. 2, paragraph 3). 
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7. Claims 63 64, and 66 are rejected under 35 U.S.C. 1 03(a) as being unpatentable 
over Palmer et al., (Chinese Word Segmentation and Information Retrieval; 1997) in 
view of Dien et al., (Vietnamese Word Segmentation, 2001). 

As per claim 63, Palmer et al., teach a computer-implemented method for 
performing word segmentation, the method comprising: 

receiving an input of text; utilizing a computer processor that is a functional 
component of the computer to apply a language model so as to determine a 
segmentation of the text ("segmentation based on character bigrams"; page 175, col. 2, 
paragraph 1; page 177, col.2, paragraph 5); 

identifying a morphologically derived word within the text; and providing an output 
that includes the segmentation of the text ("derived words... to the segmentation 
accuracy"; page 177, col.2, paragraph 3). 

However, Palmer et al., do not specifically teach an indication of a combination of 
parts that form the morphologically derived word. 

Dien et al., teach that those are the words morphologically derived 
(R.Sproat,1996), e.g.: coa gaeng (attempt, v) soT coa gaeng (attempt, n) hiean ha'i'i 
(modern ,a ) hiean ha'i'i houa (modernize) chuu toch (presidents) phou chuu toch (vice- 
president) [section 4.1 .5]. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made use indication of a combination of parts that form derived 
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words as taught by Dien et al., in Palmer et al., because that would contribute to the 
segmentation accuracy (Palmer et al., page 177, col.2, paragraph 3). 

As per claim 64, Palmer et al., further disclose the output also includes an 
indication of a named entity detected within the text ("Chinese person names"; page 
177, col.2, paragraph 3). 

As per claim 66, Palmer et al., further disclose the output also includes an 
indication a part of speech represented by the combination of parts [section 4.1 .5]. 

8. Claim 65 is rejected under 35 U.S.C. 103(a) as being unpatentable over Palmer 
et al., (Chinese Word Segmentation and Information Retrieval; 1997) in view of Dien et 
al., (Vietnamese Word Segmentation, 2001), and further in view of Guo et al., (US PAP 
2002/0052901). 

As per claim 65, Palmer et al., in view of Dien et al., do not specifically teach the 
output also includes an indication of a factoid detected within the text. 

Guo et al., teach that for Chinese language, the morphologic process 
includes the steps of: (1) segmenting sentences into words according to the 
system dictionary and the user-defined dictionaries; (2) identifying proper 
names (currently including person names, place names and person titles), domain 
terms, numbers, measure words, and date expressions (paragraph 29). 
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Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to identify factoids as taught by Guo et al., in Palmer et 
al., because that would contribute to the segmentation accuracy (Palmer et al., page 
177, col. 2, paragraph 3). 

Conclusion 

9. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to LEONARD SAINT CYR whose telephone number is 
(571) 272-4247. The examiner can normally be reached on Mon- Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is (571)- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
LS 

05/13/09 
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